Search CORE

4 research outputs found

Performance Improvement of Distributed Computing Framework and Scientific Big Data Analysis

Author: Kondikoppa Praveenkumar
Publication venue: LSU Digital Commons
Publication date: 01/01/2014
Field of study

Analysis of Big data to gain better insights has been the focus of researchers in the recent past. Traditional desktop computers or database management systems may not be suitable for efficient and timely analysis, due to the requirement of massive parallel processing. Distributed computing frameworks are being explored as a viable solution. For example, Google proposed MapReduce, which is becoming a de facto computing architecture for Big data solutions. However, scheduling in MapReduce is coarse grained and remains as a challenge for improvement. Related with MapReduce scheduler when configured over distributed clusters, we identify two issues: data locality disruption and random assignment of non-local map tasks. We propose a network aware scheduler to extend the existing rack awareness. The tasks are scheduled in the order of node, rack and any other rack within the same cluster to achieve cluster level data locality. The issue of random assignment non-local map tasks is handled by enhancing the scheduler to consider the network parameters, such as delay, bandwidth and packet loss between remote clusters. As part of Big data analysis at computational biology, we consider two major data intensive applications: indexing genome sequences and de Novo assembly. Both of these applications deal with the massive amount data generated from DNA sequencers. We developed a scalable algorithm to construct sub-trees of a suffix tree in parallel to address huge memory requirements needed for indexing the human genome. For the de Novo assembly, we propose Parallel Giraph based Assembler (PGA) to address the challenges associated with the assembly of large genomes over commodity hardware. PGA uses the de Bruijn graph to represent the data generated from sequencers. Huge memory demands and performance expectations are addressed by developing parallel algorithms based on the distributed graph-processing framework, Apache Giraph

Louisiana State University

Exploring parallelism and desynchronization of TCP over high speed networks with tiny buffers

Author: Chiu Chui-Hui
Cui Cheng
Kondikoppa Praveenkumar
Park Seung-Jong
Xue Lin
Publication venue: LSU Digital Commons
Publication date: 01/09/2015
Field of study

Louisiana State University

Towards fair and low latency next generation high speed networks: AFCD queuing

Author: Cheng Cui
Chui-Hui Chiu
Floyd
Gettys
Ha
Hollot
Jeyakumar
Kleinrock
Lin Xue
Nichols
Pan
Perry
Praveenkumar Kondikoppa
Raina
Rosolen
Seung-Jong Park
Shreedhar
Stoica
Suman Kumar
Yin
Publication venue: 'Elsevier BV'
Publication date
Field of study

Crossref

Exploring parallelism and desynchronization of TCP over high speed networks with tiny buffers

Author: Alizadeh
Alizadeh
Beheshti
Brakmo
Cai
Cheng Cui
Chui-Hui Chiu
Cui
Enachescu
Enachescu
Floyd
Ford
Fu
Ha
Ha
Hacker
Hassayoun
Hassayoun
Honda
Jain
Jin
Khalili
Lin Xue
Liu
Park
Praveenkumar Kondikoppa
Raiciu
Raiciu
Raina
Raina
Seung-Jong Park
Shihada
Sivakumar
Tan
Villamizar
Vishwanath
Wischik
Wischik
Xue
Xue
Publication venue: 'Elsevier BV'
Publication date
Field of study

Crossref